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Abstract 

o: 

£S| . We consider a type of nonnormal approximation of infinitely divisible distributions that 

; j ' incorporates compound Poisson, Gamma, and normal distributions. The approximation relies 

Oh, on achieving higher orders of cumulant matching, to obtain higher rates of approximation error 

•^^ ' decay. The parameters of the approximation are easy to fix. The computational complexity 

of random sampling of the approximating distribution in many cases is of the same order as 

fvq ' normal approximation. Error bounds in terms of total variance distance are derived. Both the 

univariate and the multivariate cases of the approximation are considered. 
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1 Introduction 



Simulation of infinitely divisible (i.d.) random variables has many applications. In most cases, 
since closed formulas of i.d. distributions are unavailable, good approximation methods are desired. 

^NJ ' Normal approximation of i.d. distributions, which was studied in [26] and later developed in [1, 11] 

in the framework of small jump approximation, has received much attention in the literature 
[2, 13, 16, 19, 22, 23, 33]. 

The idea of small jump normal approximation is as follows. Denote by A the Levy measure of an 

^T) ■ i-d. random variable X. Given r > 0, decompose A = A r + (A — A r ), such that A — A r > has finite 

mass. Correspondingly, X = X r + A r , where X r and A r are independent i.d. random variables 
with Levy measures A r and A — A r , respectively. Then X r is approximated by a Gaussian random 
variable, while A r is sampled using standard methods for compound Poisson random variables. 
5_i , Presumably, in order for the approximation to have a certain degree of precision, the support of 



> 



o 



ctf 



X r should be in a small neighborhood of 0. The size of the neighborhood is controlled by r. For 
the univariate case, it is natural to set r equal to the maximum jump size [1]. However, for the 
nrultivariate case, such use of r can be restrictive. Generally speaking, one can use r to index any 
tunable quantity, as long as it controls (indirectly) the size of the support of A r [11]. 

Normal approxinration relies on second-order nroment matching between X r and a Gaussian 
random variable, by which we mean the matching of their first and second moments. This is 
equivalent to second-order cumulant matching. Without specifying details, by certain measures, 
the error of the approximation in the univariate case is bounded by 



C\n\^x r /^l 



X T i 



where C is a universal constant, K2,x r is the second cumulant of X r , and for j > 3, M-,-^ = 
J \x\i\ r (dx) is the jth. "absolute cumulant" of X r [1]. The best currently available value of C 
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is 0.4785 ([28], p. 71). Also, in some symmetric cases, since the third cumulants of X r and the 
Gaussian random variable are 0, |ft|3,x r in the bound can be more or less replaced with |«|4.x- r > 
while the power of H2,x r is raised to 2 [1]. From the pattern of the bound, one could guess that, 
if X r and some Y r have the same cumulants of order 1, . . . , q — 1 with q > 5, then X r could be 
approximated by Y r with the error being bounded by 

C(r)(\K\ q , Xr + N gi y r )/«I;Jr P j 

where C(r) is a near-constant, at least when r is small, or most ideally, a universal constant which 
may depend on the dimension of X but is not very large. Elementary calculations indicates that 
in many cases, the above bound vanishes at a genuinely higher rate than the bound for normal 
approximation as r — > 0+. Since the qth. cumulant of a Gaussian random variable is 0, the above 
bound, if true, is consistent with the bound for normal approximation. 

Even by some rough analysis on characteristic functions, there is good reason to expect that 
the bound is true for both univariate and multivariate cases. However, before attempting to work 
out the detail, perhaps one should first ask if such an approximation can possibly be implemented 
easily. The meaning of the question is twofold. First, the distribution of the approximating random 
variable should be easy to identify; preferably, it is i.d. Second, the approximating random variable 
should be easy to sample; preferably, the computational complexity of the sampling is of the same 
order as the normal approximation. If the answer to the question is positive, then the next question 
is how large q can be. It can be anticipated that the larger q is, the faster the error of approximation 
vanishes as r — > 0+. After the answers to the two questions are in place, a wide range of available 
techniques can potentially be modified to establish the error bound (e.g. [3, 5, 8, 28]). 

Clearly, cumulant matching is equivalent to moment matching. Actually, our proof of the 
above type of bound eventually will be based on moment matching. However, thanks to the Levy- 
Khintchine representation, it is more natural and convenient to consider cumulants than moments. 
We shall show that it is fairly easy to construct approximating i.d. random variables with matching 
cumulants up to at least the fourth order, in other words, we can get at least q = 5. In many 
important cases, we can get q = 6, and in the symmetric cases, we can get q = 10. For the univariate 
case, the construction is particularly simple. The approximating i.d. random variable is the sum 
of a compound Poisson random variable and an independent Gaussian random variable, with the 
former in turn being the sum of a Poisson number of i.i.d. Gamma random variables. Importantly, 
using algorithms already available [14, 20], the computational complexity of the random sampling 
for the approximation is universally bounded, so it is of the same order as the random sampling 
from a normal distribution. 

We shall refer to the approximation as Poisson-Gamma-Normal (PGN) approximation, although 
a longer name like "compound Poisson-Normal approximation with Gamma summands and higher 
order of cumulant matching" might be more appropriate. We shall bound its error in terms 
of total variation distance by combining Fourier analysis, Lindeberg method (cf. [0] for a modern 
application of it) , and a device in [ I ] . The results are nonasymptotic and of the aforementioned type. 
Asymptotically, when applied to X r , the approximation yields substantially higher rate of precision 
than normal approximation as r — > 0+. Of course, on modern treatments of Poisson, compound 
Poisson, and normal approximations, there is now an extensive literature, and on Gamma and 
other types of approximations, there is also a growing literature; see [3, 7, 8, 17, 27, 28, 30] and 
references therein. However, it appears that there has been little work on using convolutions of 
different types of simple distributions to improve approximation, in the sense that the error of 
approximation vanishes at a faster rate asymptotically. 

For the multivariate case, the issue of approximation becomes quite more involved, which is a 
well documented phenomenon [5, 8, 11, 17, 28, 32]. For normal approximation of i.d. distributions, 



several important issues unique to the multivariate case are identified and addressed in [11]. The 
same issues also arise in the type of approximation considered here and actually become more 
serious. To address them, we consider a "radial" cumulant matching approach. Its idea is to 
apply the same cumulant matching method for the univariate case to each radial direction in 
the Levy-Khintchine representation, in such a way that, when the approximating Levy measures 
and Gaussian measures along different radial directions are "bundled" together, we get a valid 
multivariate i.d. distribution with desired order of cumulant matching and with the covariance of 
its Gaussian component being precisely evaluated. Although the approach does not completely 
resolve the aforementioned issues, it seems to work well in many important cases. 

As in the univariate case, we shall prove a similar type of bound for the error of the proposed 
PGN approximation in terms of total variation distance. On the other hand, the issue of compu- 
tational complexity needs to be considered more carefully. Recall the approximation is applied to 
X r in the decomposition X = X r + A r . As in the univariate case, the approximating random vari- 
able for X r has a compound Poisson component. Unfortunately, this component now can only be 
sampled by summing a large number of Poisson events. As a result, the computational complexity 
of the approximation of X r is much greater than normal approximation. However, one has to take 
into account the computational complexity of the sampling of A r . We will argue using an example 
that although the proposed PGN approximation as a whole has greater computational complexity 
than normal approximation, asymptotically, as r — > 0+, the two have the same order of complexity. 
Because the PGN approximation can yield substantially higher rate of convergence, therefore, at 
least asymptotically, it is worth the extra computation. Note that whereas in [11], the focus is the 
approximation of the related Levy processes, our discussion is restricted to i.d. distributions. An 
extension of PGN approximation to processes will be subject to future work. 

In Section 2, we shall set up notation and collect useful facts about i.d. distributions. Sections 
3 and 4 consider PGN approximation for univariate i.d. distributions and multivariate i.d. distri- 
butions, respectively. The proofs of the main results in these two sections are collected in Section 
5 and the proofs of related technical results are collected in Section 6. 

2 Preliminaries 

2.1 Notation 

Denote Z + = {0} U N and R + = [0,oo). If f(x) is a function on R , where x = (x±, . . . ,xa), 
then by f {a) {x) or d a f(x) we mean 1) a = (ai, ...,a d ) £ J.%, and 2) f^ a \x) = d" 1 ■ ■ ■ d* d f(x), 
where d^ denotes the fcth-order partial derivative with respect to Xj. The order of a is defined 

to be |a| = ai H + a d - Denote x a = x* 1 ■ ■ ■ x a A d and a\ = a\\ ■ ■ ■ a k \. Denote by y(R d ) 

the space of rapidly decreasing function on R rf . It is a basic fact that the Fourier transform 
h — > h(t) = J e 1 ^'^ h(x) dx is an homeomorphism of y(R d ) onto itself ([IS], p. 103). 

Denote by sppt(i/) the support of a measure v. For two random variables X and Y, their total 
variation distance [3] is denoted by 

d TV (X, Y) = sup{P{X £ A} - F{Y e A} : A measurable} 

and, if X, Y £ R, their Kolmogorov-Smirnov distance is denoted by 

d KS (X, Y) = sup{|P{A < x} - F{Y < x}\ : x £ R}. 

For any i.d. random variable X £ R , denote by ipx and ^x its characteristic function and 



characteristic exponent, respectively 

^ x (t) = e - 9x ^ =E[e i{t ' x) ], teR d . 
Let fx be the probability density of X. If it exists, then ipx = fx- Denote 

KaX = d a \nE[e {t > x) ] _ , |k| QiX = f\u a \X(du). 

The quantity n a: x is known as the ath cumulant of X. It is well defined provided that E[e'*'^'] < oo 
for all t in a neighborhood of 0. Some properties of cumulants can be found in [28]. We shall refer 
to |k| Q) x as the ath absolute cumulant of X. 

2.2 Basic assumptions and facts 

Let X E R d be i.d. with Levy measure A. We will always assume 

V x (t)= i\l + i(t,u)-e i{t ' u) )X(du), teR d , (1) 



in particular, X has no Gaussian component and EX = 0. The assumption causes no loss of 
generality since, if necessary, we can decompose X as X' + X" , such that X' has a Levy measure 
satisfying (1) and X" is a compound Poisson random variable. Then we can take X' — EX' as the 
new X. We will also always assume 



= oo. 

Under the assumption, X is not compound Poisson and P{X = x} = for x E K ([31], Theorem 
27.4). It is known that if d = 1 and X(R) < oo then X does not admit normal approximation [1]. 
The assumption excludes the case of lattice valued i.d. random variables, for which Poisson-Charlier 
approximation has been studied [3, 26]. 

Recall that for any a > 0, E||X|| a < oo if and only if Jl{||it|| > 1} ||-u|| a A(d«) < oo ([31], 
p. 159-160). If A is a Borel measure on R d with A({0}) = and J(||u|| 2 A 1) A(du) < oo, then A is 
the Levy measure of some i.d. random variable ([31], Theorem 8.1). Also, if sppt(A) is bounded, 
then E[e<*'*>] < oo for all t E R d ([31], Theorem 25.17). By differentiation, 

fo M = i 

' \ju a X(du) |q|>1 
for all a. It is easy to see that if each Oj is even or sppt(A) E M+, then n a ,x = \ K \a,x- Also, 

V(X) = J uu'A(du), 



and hence tr(V(X)) = f \\u\\ 2 A(dtt), where ti(A) denotes the trace of a square matrix A. 

Let / E C(R d ). For n E N, let U n be i.d. with W Un = n'^x- K sppt(/) C R d \ {0} 
and is compact, then nEf(U n ) — > J/dA, which directly follows from the vague convergence of 
nP{U n E dx} to A(dx) on {x : ||jc|| > e} given e > ([!], p. 39); see [21] for detail on vague 
convergence. The next result, which will be used later, concerns the case where sppt(/) is not a 
compact set in R \ {0}. When d = 1 and f(x) = \x\ p with p > 2, the result is established as 
Lemma 3.1 in [1]. However, as seen from the case X ~ N(0, 1), the asserted convergence in general 
is not true if f(x) = x 2 . 



Proposition 1. Suppose \f(x)\ < <7(||x||), where g S C(K+) is nondecreasing with g(t) = o(t 2 ) as 
t — > 0+. Suppose E||X|| 2 < oo and one of the following holds, 1) Ep(c||X'||) < oo for some c > 1, 
where X' = X\ — X%, with X\ i.i.d. ~ X, 2) E<7(2c||X||) < oo for some c > 1, or 3) provided X is 
symmetric, Kg(\\X\\) < oo. Then f G -^ 1 (A) and nEf(U n ) — > J f d\ as n — >• oo. 

3 Univariate Poisson-Gamma-Normal approximation 

3.1 Cumulant matching 

For simplicity and without loss of generality, we will only consider two cases 1) sppt(A) C M4. and 
2) X is symmetric. First, suppose sppt(A) C R+. Given r > 0, decompose X = X r + A r , where 
X r and A r are independent i.d. random variables such that 

V Xr (t)= f{l + itu-e itu )X r (du), with A r (du) = 1 {u < r} X(du) . (2) 

Given p > — 1, let 1^. be an i.d. random variable with 

/>oo 

*y r (t) = / (1 + itu - e iin ) 7r .(dn), with 7r (dn) = m(r)u p e - n/s(r) dn, (3) 



.70 

where 77i(r) > and s(r) > are constants that need to be determined. Finally, let 

T r = Y r + a(r)Z, Z ~ iV(0, 1) independent of Y r , (4) 

where <r(r) > is a constant that needs to be determined. 

We shall use T r + A r to approximate X, or equivalently, use T r to approximate X r . But first, let 
us point out how easy it is to sample T r . Clearly, the issue is the sampling of Y r . Since Y r = U — E.U, 
where U > is i.d. with Levy density m(r)l {u > 0} u p e~ u > s ( r > , and KU = T(p + 2)m(r)s(r) p+2 , 
we only need to consider the computational complexity of the sampling of U. If p = — 1, then 
U ~ Gamma(m(r), s(r)), the Gamma distribution with shape parameter m(r) and scale parameter 
s(r). It is known that the sampling of Gamma(a, b) has universally bounded complexity regardless 
of (a, b) ([14], p. 407-420). If p > -1, then U ~ E£=i&. where ^ ~ Poisson(a) with a = 
J °° m{r)u p e~ u < s ^ r > du = T(p + l)m(r)s(r) p+1 , and ^ are i.i.d. Gamma(p + 1, s(r)) random variables 
independent of N. The sampling of Poisson(a) is known to have universally bounded complexity 
([15] or [20], p. 228-241). On the other hand, conditional on N, U ~ Gamma(iV(p + l),s(r)). 
Therefore, the sampling of U, and hence that of T r , has the same order of complexity as the 
sampling of a normal random variable. 

Due to the Levy-Khintchine representation of T r , we refer to the approximation of X by T r + A r , 
or X r by T r , as Poisson-Gamma-Normal (PGN) approximation. 

It is easy to see EX r = ET r = KY r = 0, and for j > 2, 

K j,x r = f u ] A r (du), K j:Tr = Kj,Y r + 1 {j = 2} o-(r) 2 , 

(5) 
with Kj,Y r = T(j+p + l)m(r)s(ry+P +1 . 

This is the starting point of cumulant matching between X r and T r . In the next result, we allow 
r = 00, so it applies to any i.d. random variable with finite fourth cumulant. 

Proposition 2 (Fourth-order cumulant matching). Fix < r < 00. If r = 00, also assume 
K 4:,x r < °°- Then for all large p, 



P + 4 K2,X r Ki,X, 
p + 3 K 3 j 



< '^p^L . ( 6 ) 



For any p > — 1 satisfying (6), if 

S{r) ~ (p + 4)^ ' m[r) ~T(p + A)s(t)p^ ' l i} 

and ifY r is defined by (3), then K2 t x r > K 2,Y r , and by setting 

0-(r) = (K 2 ,X r -K2,Yr) 1/2 , (8) 

K j,x r = K j,T r for 2 < j < 4. 

Proof. By assumption, Kj,x r < oo for 2 < 2 < 4. By Holder inequality, k^ x < K2,x r ^i,x r -, which 
implies (6). Prom (5), by setting s(r) and m(r) as in (7), Kj t x r = K j,Y- f° r i = 3, 4 and 

w i q\ / \ / yp+3 w , Q^ K3 .*r ( p + ^ K 3,x r 
K 2 ,Y r = r(p + 3 )ro(r)s r) p+J = r(p + 3 = . 

r(p + 4)s(r) (p + 3)K 4 ,x r 

Then for p > — 1 satisfying (6), «2.y r < K 2,x r - The rest of the result is then clear. □ 

Proposition 3 (Fifth-order cumulant matching). Let X(du) = l{u > 0} u~ a ~ l £(u) du, where a E 
(0, 2) and £{u) is slowly varying at 0+. Let p = p{r) be defined by the equation 

1 , 1 _ K 3,X r K 5,X T 

P + 4 ~ 4,x r 

Then for all small r > 0, p > —1 and satisfies (6), and by setting s(r), m(r) and <r(r) according 
to (7) and (8), Kj,x r = ^j,T r for 2 < j < 5. 

Proof. Since £ is slowly varying at 0+, for j > 3, 

r ■ 1 r j ~ a £(r) 

KjXr = / u 3- l - a £{u)du~ , {) , (9) 

Jo J -a 

as r — > 0+ [4, 24]. As a result, 

1 i ( 4 ~ Q ) 2 _i = * r ^ + . 



p + 4" k 2 ^ (3_ G ) (5 _ a ) - ( 3_ G )( 5 _ a )' 

It follows that p ~ a 2 — 8a + 11 > — 1. Thus, for all small r > 0, p > — 1. By Proposition 2, it only 
remains to show that p satisfies (6) and K5,x r = ^5,Y r - By (9), asr-y 0+, 

K2,x r K4,x r (3 -a) 2 _ 1 



r-u 



K 2 3Xr (2-a)(4-o) a 2 -6a + 8' 

Therefore, with p > —1, (6) is equivalent to p > a 2 — Qa + 5, which holds for a € (0, 2). Finally, 
that K 5tXr = K 5)Yr follows from K^ >Xr ^,x r / n\ Xr = (P + 5 )/(P + 4 ) = K 3,y r K 5 ,y r /^ yr ,. □ 

Now we consider the symmetric case. Suppose X = X^' — X^ 2 \ where X^> are i.i.d. with 
Levy measure A supported in M + . Let X r = Xr — X r , and approximate it by T r = T r — T r , 
where Tr are i.i.d. defined in (4). Since all the odd-ordered cumulants of X r and T r are 0, we only 
need to match their even-ordered cumulants. The next results states that for the general case, we 
can match their cumulants up to order 7, and for the i.d. distribution as in Proposition 3, we can 
match their cumulants up to order 9. 



Proposition 4 (Symmetric case). 1) Fix r > 0. Then for all large p, 

(p + 5)(p + 6) ^ K 2 ,X T K6,X. 

,2 

v 4,x 
For any p > — 1 satisfying (10), if 



(p + 3)(p + 4p ^ 



lj V(p + 5)(p + 6)«4,*.' 2r(p + 5) S (r)P+ 5 ' l j 

and ifY r = Yr — Y r - , where Y r - are i.i.d. as defined in (3), then n 2 x r > K 2,Yr> an d by setting 
a{r) as in (8), Kj )Xr = Kj,T r for 2 < j < 7. 

2) If the Levy measure A of X" is l{u > 0} u~ a ~ 1 £(u) du, where a G (0, 2) and £(u) is slowly 
varying at 0+, i/ien /or aZZ small r > 0, i/iere is a unique p = p(r) > satisfying (10) and 

(ff+7)(p + 8) _ ^4,Xr^8,Xr / 12 x 

{p + 5)(p + 6) n\ Xr • l J 

Consequently, for this p, by setting s(r) and m(r) according to (11) and a{r) according to (8), 
Kj,x T = Kj,t t for 2 < j < 9. 

Proof. 1) By Holder inequality, k\ x < K2,x r ^6,x r i so for all large p, (10) is satisfied. Since for 
even-valued j, kjy t — 2k. v (i) = 2T(j -\- p + l)m(r)s(r)- 7+p+1 , it is easy to see k^x t = ^iY r an d 
^6,x r = K6,Y r - O n the other hand, for all odd- valued j, Kj,x r = K j,Y r = 0- Finally, by similar 
argument for Proposition 2, K 2 ,Y r < K 2,x r , leading to Kj t x r = K j,T r for 2 < j < 7. 
2) Following the proof of Proposition 3, 

K4,x r n 8 ,x r (6 - a) 2 1 4 

n ~t n t = 1 + t w r := Ma), r — > + . 

«g iJCp (4-a)(8-a) (4-a)(8-a) {h 

Clearly, h(a) is strictly increasing on (0,2). On the other hand, 



(p + 5)(p + 6) \ p + 5J \ p + 6 

is strictly decreasing on (— l,oo), with g(0) > h(2) > h(a) > h(0) > 1 = g(oo). Therefore, there 
is a unique p > satisfying (12). We have to show that for this p = p(r), (10) is satisfied for all 
small r > 0. By continuity, it suffices to show that for p > 0, 

(p + 7)(p + 8) _ (6 -a) 2 _ (;P + 5)(;p + 6) (4 - a) 2 



(p + 5)(p + 6) (4-a)(8-a) (p + 3)(p + 4) (2-a)(6-a)' 

By calculation, the equality is equivalent to 2p 2 = 2p(a 2 — 12a + 21) + 13a 2 — 156a + 356, while the 
inequality is equivalent to 2p 2 > 2p(a 2 - 8a + 5) + 9a 2 - 72a + 84. Then, by p > and < a < 2, 
the equality indeed implies the inequality. The rest of the proof then follows the one for 1). □ 

Propositions 3 and 4 directly lead to the following result on the truncated stable case. Note 
that for the non-truncated case, simple exact sampling method is known [14]. Also, for a £ (0, 1), 
the truncated stable distribution can be sampled exactly [9]. 



Corollary 1. Let X(du) = cl{0 < u < ro}u _a_1 du, where c > 0, r$ G (0, oo), and a G (0,2). 

1) Suppose X has Levy measure A. If p = a 2 — 8a + 11, then p > — 1 and /or a/Z < r < ro, 
by setting s(r) and m(r) according to (7), K 2> x r > K 2.Y r > a^d °2/ setting a(r) according to (8), 
>%\x r = Kj, Tr for2<j<5. 

2) Suppose X = X^ l > — X^ 2 ' , with jW freing i.i.d. u>i£/i Levy measure A. 7/p is i/ie unique 
solution in (0,oo) to 

(p + 7)(p + 8) (6 -a) 2 



(p + 5)(p + 6) (4-o)(8-o)' 

i/ien for all < r < ro, oy setting s(r), m(r) according to (11), «2X r > K 2,Y r , and by setting o~(r) 
according to (8), ftj, x r = K j,T r for 2 < j < 9. 

3.2 Error bound for approximation 

We consider the error of approximation of X by A r + T r . Denote constants 

Ci = sin 1 = 0.841 

1 f p 
C 2 = inf — — / u v ~ x e~ u du = inf P{£„ < p}, f p ~ Gammafp, 1). 

P >o r(p) y P >o 

Note that C2 G (0, 1) because 

-4t / u p_1 e-" du ^- / u p " x du ~ pP -»• 1, p -»• 0+ 

!» Jo !» Vo 

and by Central Limit Theorem, P{£ p — p < 0} — )• 1/2 as p — )• 00. 

Observe that for s(r) defined in (7) or (11), s(r) < r/(p + 3). The main result of the section is 
the following. 

Theorem 1. Fix r G (0, 00). Let T r be defined by (3) - (4) for the asymmetric case, or by 
T} — Tr for the symmetric case, with T r i.i.d. defined by (3) - (4). Suppose s(r) < r/(p + 3) 
and o~(r) > 0. Let 

t 2 
L{t,r) = —mm{CfK 2 ,x im , o-(r) 2 }. 

For j > 1, define Qj(r) > such that 

O(r) 2 - r(j ' + 1/2) + K j+1/2 l^ t 2j e- 2L ^ dt 
QAr) -2{C 2 C 2 )^/ 2+K2 ^ klr 

Given q > 5, suppose Kj,x r = K j,T r f or 2 < j < q- Th 



< 11 



\K\q,Xr + \ K \q,Y r 



d T y(X,A r +T r ) < ' ,q ' r „/ 2 ,q ' lr [qQ g -i(r) + Q q (r) + Q g+1 (r)}. (13) 

Q lK 2,X r 

Remark. 

1. The bound is on aVv instead of the more commonly used dxs [lj 26]. However, we have not 
been able to derive a Berry-Esseen type of bound of the form C(\K\ qt x r + \ K \q,Y r )/ K 2 x > with 
C a universal constant only depending on q. It appears that some key ingredients for the 
proof of the Berry-Esseen bound for normal approximation are still missing for higher order 
approximations. Also, it is likely that the constants in the bounds are not optimal. 



2. The bound will be proved by combining Fourier analysis, the Lindeberg method, and a device 
in [1] (cf. the proof of Theorem 25.18 in [31]). Although a bound on d^s may be established 
solely based on Fourier analysis [10, 26], our proof seems to be more transparent and suitable 
for generalization to multivariate cases. 

In the bound for d^y(X, A r +T r ), Qj{r) look rather technical. We can use the following result 
to bound them. 

Proposition 5. For b G (0, 1) and q > 3, there is M = M(b, q) > 0, such that if 

lim sup ^- < b, lim inf ^ %Xr , - > M, (14) 

r-M)+ K 2 ,x r r->o+ r 2 ln(l/r) 

then for any 2 < j < q + 1 , 

Since the proof is short, we give it here. By (14), for all small r > 0, cr(r) 2 = K-2x r ~ K 2,Y r > 
(1 — 6)ft2,x r - Then, from the increasing monotonicity of K2,x r in r , there is a constant c = c(b) > 0, 
such that for t > 1/r, L(t,r) > ct 2 K2,x 1/t - Consequently, if M > (q + 2)/c, then by (14), for 
t > 1/r, L(t, r) > Mchit > (q + 2) hit, and hence for all 2 < j < q + 1, 

/ 0e~ 2L ^dt< / t 2 ^ +1 )- 2Mc dt = o(l), r^0 + . 

J 1/r Jl/r 

Since K2,x r = o(l) asr-y 0+, the proof is complete. 

Example 1. Let A(du) = cl {0 < u < r^} u~ a ^ 1 du, where c > 0, < r$ < oo, and a € (0, 2). By 
Corollary 1, given r S (0, ro), if p = a 2 — 8a + 11, and s(r), m(r) and a(r) are set according (7) - 
(8), then Kj,x r = K j,T r for 2 < j < g = 6. To apply (13), we need to get K 2y x r , \ K \e,x r = «6,x r , and 
|«|6,y r = K 6,y r - For j > 2, K j:Xr = cr j ~ a /(j - a). Since 

K4,x r (3 - a)r r 



s(r) 



K 2,Y r 



(p + 4)K 3 ,x r (p + 4)(4-o) (4-o)(5-o)' 

K3,Y r K3,x r c(4-a)(5-a)r 2 ~ a 



(p + 3)s(r) (p + 3)s(r) (3 - a)(a 2 - 8a + 14) ' 

then K 6i y r = (6 +p)s(r)K 5i y r = (6 + 2>)s( r ) K 5,x r = cA(a)r 6 ~ a , with 

a 2 -8a + 17 
A(a) " (4-a)(5-a)2- 

Therefore, by Theorem 1, 



d T v{X,A r + T r ) < 



(2 -a) 



:-! 



C 



2 



1 



6 



A(a) 



v 6Q 5 (r) + Q 6 (r) + Q 7 (r) 2a 
x 6! ' 



Since < K2,y t /k2,x t < 1 is a constant independent of r, and A satisfies Orey's condition 
liminf r _ ! .o+ K 2,x,/^ 2 ~ a > ([29]; also see [31], Proposition 28.3), the conditions in (14) are satis- 
fied no matter the value of M. Then by Proposition 5, d r ry(X, A r + T r ) = 0(r 2a ). This may be 
compared to the normal approximation in [1, 26], where dxs between X and its normal approxi- 
mation is of rate 0(r a ' 2 ) when X is asymmetric. 

Furthermore, if X = X^ 1 ' — X^ 2 ' is symmetric, where X^' are i.i.d. with Levy measure A, then 
by similar argument while using 2) of Corollary 1, it can be seen that we can set q = 10 and get 
d r ry(X, A r + T r ) = 0(r 4a ), whereas the oks between X and its normal approximation in this case 
is of rate 0(r a ) [1]. □ 



Example 2. Let A(du) = 1 {u > 0} u~ a ~ x exp(— u b ) du, where a £ (0,2) and b > 0. If we 
directly evaluate L u J X(du) for j > 2, there is no closed formulas available. The following 
method avoids the problem. Recall that for any odd positive integer n, e~ u > f n (u) for u > 0, 
where f n (u) = Y27=o(~ u) % /i\. Let n > 1 be the smallest odd number greater than a/6 — 1 and 
F(u) = 1 {0 < u < r } f n (u b ), where r = sup{r > : f n (u) > for all < u < r b }. Decompose 
X(du) = X\(du) + \2(du), where Ai(du) = l{u > 0} u~ a ~ 1 F(u) du. Because u~ a ~ 1 [exp(—u b ) — 
F(u)] = 0{iv- n+l > b ~ a ~ l ) as u — > 0+, A2 has finite mass, and hence corresponds to a compound 
Poisson random variable that can be sampled exactly. Since Ai(dtt) = 1 {0 < u < vq} u~ a ~ 1 f n (u b ), 
for < r < r , it is easy to evaluate f£ u 3 \i(du). Then we can apply PGN approximation to Ai- 
If X, X', and X" denote i.d. random variables with Levy measures A, Ai, and A2, respectively, and 
A r and T r the i.d. random variables from the approximation, then by Proposition 3, we can get 
d TV {X, A r + T r + X") < Chv(X', A r + T r ) = 0(r 2a ). 

Example 3. Let X(du) = c\ {0 < u < 1} n" 1 ln(l/n) du. Since 

2(/J , f r W1/ N , cr 2 [21n(l/r) + l] 
u X(du) = c uln(l/u)du = —^ - 

u<r JO 4 

by Proposition 2.1 in [1], normal approximation works in the sense that its error in terms of dxs 
tends to as r — > 0+. However, since for \t\ S> 1, 

ti s c b 2 f 2,n s cC?ln|t| 
L{t,r) = ^— u 2 X{du) i— Li, 

1 Ju<l/\t\ l 

condition (31) holds only when c is large enough. Furthermore, even when (31) holds, the bound 
in (13) decreases to very slowly as r — > 0. □ 

4 Multivariate Poisson- Gamma-Normal approximation 

4.1 Radial cumulant matching 

In this section, we assume iGf* such that in polar coordinates its Levy measure is 

A(dn, d0) = X{du | 9) v{d6), 9 e 5, u > 0, 

where S = {9 £ M. d : \\9\\ = 1}, v is a finite measure on S, and for each 9 £ 5, A(du | 0) is a Levy 
measure on (0,oo). For symmetric X, A(du|0) = A(du| — 9) and j/(d0) = u{—d9). Without loss 
of generality, assume 

sppt(A) is bounded and not contained in a linear space of lower dimension. 

In particular, the assumption implies E||X|| P < 00 for all p > 0. 

The so-called radial cumulant matching is as follows. For each 9 £ S, find cr(9) > and a Levy 
measure j(du \ 9) on (0, 00), such that, first, for some q > 2, 

I u j X{du \9) = l{j = 2} a{9) 2 + I ui~i(du \ 9) i/-a.e. 9 

for all 2 < j < q if X is asymmetric, or for all even valued j > 2 less than q if X is symmetric, and 
second, for 9 ^ 9', if X(du \ 9) = X{du \ 9'), then a(9) = a(9') and 7 (dn | 0) = j(du \ 9'). Let T be 
an i.d. random variable with 



*r(*) 



ia(0) 2 (t, 0) 2 + /(l + i(t, ff)« - e i(t ' e)u ) 7 (du I 0) 



2 



i/(d0). (15) 
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Then ET = 0. If X is asymmetric, then for any a with 1 < \a\ < q, 

K a .T= I 9 a l{\a\ = 2}a(9) 2 + f u^j(du \ 9) v{d9) = /Vu |o| A(du \ 9)u(d9), 

which is just n a ,x- If X is symmetric, the equality holds for any a with \a\ being even and 
1 < |a| < q. On the other hand, if \a\ is odd, then K at x = and from the construction of 7, 
K a ,T = 0. Therefore, X and T have the same cumulants up to order q — 1. 

The Gaussian component of T has covariance f 99'a(9) 2 u(d9), which can be difficult to evaluate. 
For normal approximation, the issue can be circumvented by using the asymptotic of the covariance 
[11]. However, this approach rules out higher order approximation. We propose the following 
solution. Since A(du | 9) is a Levy measure on (0, 00) for each 9 E S, given r > 0, it is possible to 
select r = r(9) > and then set p = p(9), m(r) = m(r,9), and s(r) = s(r,9) as in Propositions 2, 
3, or 4, such that, letting j(du \ 9) = m(r)l {u > 0} u p e~ u / s ^ du, 

a(9f = I 1 {u < r} u 2 X(du \9) - I u 2 -f{du \ 9) = r 2 > 0, z/-a.e. 9 £ S. 

With this choice of ^(du \ 9), the Gaussian component is ^(O^ 2 ^), where 

K v = I 99'v{d6) (16) 



can be much more manageable. This is the same matrix identified in formula (3.17) of [11]. By 
the assumption on A, K v is positive definite (p.d.). 

There is some flexibility in choosing v. Given w(9) measurable on S with < essinf-u; < 
ess sup w < 00 under v, A(dn, d9) can be written as A(d-u | 9)i>(d9), where 

A(d-u I 9) = w(9)X(du \ 9), 9(d9) = u(d9)/w(9). 

If r, p, m(r) and s(r) are set according to A(dn| 9) instead of A(dn| 9), then the matrix in (16) 
becomes J 99'v(d9). This allows one to choose w(9) to simplify the evaluation of the matrix. 

In this setting, r instead of r is the parameter, and r, p, m(r), and s(r) are functions of (t,9). 
We denote the functions by r T (9), p T {9), m T {9) and s T (9), respectively. Evidently, 

//■oo 
1 {u < r T {9)} u 2 \{du \9)- m T (9)u p ^ 0)+2 e- u/s ^ 9) du = r 2 u-a.e. 9 £ S. (17) 

Additionally, 

\{du\9) = \(du\9') => f(9) = f(9'), for / = r T ,p T ,m T ,s T for all r > 0. 
Now define Levy measures 



A T (dn, d$) = \ T {du I 9)u{d0), with \ T {du \9) = l{u< r T {9)} \{du \ 9)u(d9), 
7r (dn, d0) = 7 T (dn | 9)u(d9), with 7r (dn | 9) = m T (9)l {u > 0} n ^W e - u/ ^ (e) du. 



(18) 



Then, for suitable q > 5, which depends on how r T , p T , m T , and s T are constructed, 

I u j \ T {du I 9) = 1 {j = 2} t 2 + I u j ~i T {du I 9), v-&.s. 9eS (19) 
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for all 2 < j < q if X is asymmetric, or for all even valued j > 2 less than q if X is symmetric. 
Next, decompose X as the sum of independent i.d. random variables X T and A r , with 



*x T (t) = I (1 + i«(t, #> - e ltt <*- e >) A r (d«, d0), 

and ^A T (i) = *x(i) — x I / x T (i)- Then approximate X r by 

T r = Y T + rZ, with Z ~ iV(0, A'„) independent of Y,., 

where 1^- is i.d. with mean and no Gaussian component, and with Levy measure y T . Finally, X 
is approximated by A r + T T . 

Clearly, in order for the solution to be valid, A r and 7 T have to be valid Levy measures. First, 
this means 1 {u < r T (9)} and m T (9)l {u > 0} u Pt ^ 'e~ u ' Sr ^ ' must be measurable functions of (u, 9). 
In many cases, the measurability is not difficult to verify. Provided it is established, A T immediately 
is a valid Levy measure. On the other hand, since J u 2 ~j T (du,&6) < J u 2 X(du,d9) < oo, by the 
comments in Section 2.2, 7 T is a valid Levy measure and E||l^-|| 2 < oo. 

The solution shifts the burden of evaluating the normal covariance to the sampling of A r and 
Y T . For the latter, the following statements are true. 

Proposition 6. 1) If 

I B T (6)u(dO) < oo withB T {9)= I l{u> r T (6)} \(du\6), (20) 

then A T ~ Ci&l + • • • + Cn^n — Aj where {loi} is a Poisson process on S with Levy measure B T dv, 
conditional on {uii}, Cl, ■ ■ ■ , Gv are independent, with Q ~ 1 {it > r T (oJi)} X(du \ uji) / B T (Cji) , and 

fx= I 9ul{u> r T {9)} \(du | 9) i/(d0). 

2) Ifp T {9) > -1 for u-a.e. 9 and 

I N T {9)v(d9) < oo with N T (9) = T(p T (9) + l)m T (9)s T (9) p ^ + \ (21) 

Js 

then Y T ~ Ci^i + • • • + Cn^n — /^ where {uji} is a Poisson process on S with Levy measure N T dv, 
conditional on {uj{\, C,\, . . . , Qn are independent, with Q ~ Gamma(p T (wj), s T (iOi)), and 



M= / 9(p T (9) + l)s T (9)N T (9)u(d9). 

3) If X is symmetric, then fi = fx = 0. 

4) If X is direction independent, i.e, X(du \ 9) = Ao(dn) for v-a.e. 9 G S for some Levy measure 
Ao, then given r > 0, r T , m T , p T , s T , and N T are u-a.e. constant, and fi = 9 V j ul {u > r T } Xo(du), 
fJ- = 6u(Pt + l)sr-A r T , where 9 V = J s 9u(d9). 

Proof. 1) It is easy to see A r has mean and no Gaussian component, and its Levy measure is 
1 {u > r T (9)} X(du\9)v(d9), which by assumption (20) has a finite mass. Then 1) follows from 
standard results on compound Poisson processes [12]. The proof of 2) is similar. Finally, both 3) 
and 4) follow from the construction of the functions r T (9), p T (9), s T (9), and m T {9). □ 
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To apply the result, it is desirable that 

esssup-B r (#) < oo, esssup-/V r (0) < oo under v, 

because {u)j} and {uii} then can be sampled using the standard thinning method ([14], p. 253-255), 
as long as it is easy to sample the Poisson process on 5 with Levy measure Cu for any C > 0. In 
this case, there is no need to know all B T {9) and N T {9) beforehand. Parts 3) and 4) of Proposition 
6 lists two cases where \x and [i are calculable. It follows that the quantities are still calculable if 
A is the sum of a symmetric Levy measure and a Levy measure A' that is "piecewise" direction 
independent, i.e., A'(du| 9) = Aj(du) for i/-a.s. 9 G Si, i = 1, . . . ,n, where Si form a partition of 
S. However, in other cases, p, and p may not have closed form expressions. This is the limitation 
alluded in Introduction. 

Example 4. Let A(dn, d0) = 1 {0 < u < r } u'^ 9 ^ 1 du v(d9), where r G (0, oo) and a(9) G (0, 2) 
is a measurable function on 5, such that 

< ao := essinf a < ess sup a = a\ < 2 under v. (22) 

Suppose also that A is symmetric. For a G (0,2), let 7r(o) be the unique solution in (0,oo) to 

(p + 7)(p + 8) _ (6 - a) 2 
(p + 5)(p + 6) ~ (4-o)(8-o)' 

Since A(du | 9) = 1 {0 < u < r } n -0 ^ -1 du, in light of Corollary 1, define r T (9), p T {9), s T (9), and 
m T (9) as follows. First, let p T (6) = n(a(9)). Next, provided r T {9) G (0, ro), define 

s T (9) = J 1 (a(9))r T {9), rn T (9) = J 2 (a(9))r T (9)- 1 ' a ^-«^ 
according to (11), where 



Ji{a) = J , , s ■ J* °1 -7- r, J 2 (a)- 



(7r(a) + 5)(7r(o) + 6)(6-o)' v ' 2r(vr(a) + 5)(4 - a)J 1 {aY^)+^ 

are strictly positive and continuous on (0,2). This yields f u 2 X T (du\9) = Jo(a(9))r T (9) 2 ~ a ( e ' , 
where Jo (a) = 1/(2 — a), and 

u 2 lT (du | 9) = r(Tr(a{9)) + 3)m T (0)s(r)"( a W)+ 3 = J 3 (a(9))r T (9) 2 - a(e \ 

where J3 G C(0, 2) is strictly positive. In particular, < J^{a) < Jo (a). Finally, from condition 
(17), it follows that if r T (9) G (0,r ), then r T (9) = J 4 (a{9))T 2 ^ 2 - a ^\ where 

J4(a) = [Jo(a) - J 3 (a)]- 1 /( 2 " a ) G C(0,2). 

By assumption (22), for all small r > 0, ess sup J4(a(#))r 2 '( 2_a ' >' < ro, and hence all the above 
definitions are valid. Since 7r(a) and all Jj(a) are continuous functions, ir(a(0)) and Ji(a(9)) are 
measurable functions of 9. It is then easy to see 1 {n < r T (9)} is a measurable function of (u, 9) 
and hence A r is a valid Levy measure. Likewise, -y T is a valid Levy measure. Consequently, by 
Corollary 1 and the symmetry of A, (19) is satisfied with q = 10. 
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We consider the sampling of A T and Y T based on Proposition 6. Given r > small enough, by 
r T (6) = J 4 (a(#))T 2 /( 2 - a W) and (22), 



ess sup B{9) = ess sup / u~ a ^ '~ x du < ess sup — — — — < oo. 
Jr T <e) a (#) 



Since A is symmetric, then by Proposition 6, A r = CiWi+- • -+(n&n, where {Cj{\ is a Poisson process 
that can be sampled using the thinning method, while conditioning on a)j, the probability density 
of Q is proportional to l{r T (uii) <u< ro}u~ a ^ i ' 1 . On the other hand, by the construction 
of p T , m T and s T , N T (9) = J (a(9))r T (9)~ a ^ e > for some continuous J{a) > 0, so again by (22), 
ess sup N(9) < oo. Then Y T = (±uji + • • • + Cn<^n, where {cjj} is a Poisson process that can be 
sampled using the thinning method, while conditioning on Wj, Q ~ Gamma(?r(o(wj)), s t (wj)). 

Unfortunately, unlike the univariate case, currently there are no computationally efficient meth- 
ods to sample A T or Y T , other than sampling individual (tDj, Q) or (wj, Q) and then taking the sum 
of Quii or QiUJi. This raises the issue of computational complexity of the PGN approximation. In the 
next subsection, after obtaining an error bound for the PGN approximation, we will come back to 
the issue and argue that, in some cases, comparing to normal approximation [11], the improvement 
in error rate is worth the extra computation complexity, at least asymptotically. 

Finally, we remark that if v(d9) = w{9) a(d9), where a is the spherical measure on S and 
w(9) is measurable on S with < essinfu> < ess sup w < oo under a, then by setting r T (9) = 
J 4 (a(9))[T 2 /w(0)] 1 K 2 - a( M and adjusting s T (9) and m T {9) accordingly, we get K v = J99'a{d9) = 
I/d. With a(d9) being the new v(d9), the sampling of A r and Y T can be done as above. □ 

Example 5. Normal approximation of tempered stable processes is studied in detail in [11]. For 
such a process, X(du\9) = 1 {u > 0}u~ a ~ 1 h(u,9)du, where for each 9, h(u,9) is a completely 
monotone function with h(0+,9) = 1 and h(oo,9) = 0. A generalized shot noise series represen- 
tation is used in [11] for normal approximation. Although the resulting covariance of the normal 
distribution in general has no closed form, it is shown that by using its asymptotic, the normal 
distribution still works. 

In the context of PGN approximation to the i.d. distribution with Levy measure A, we have to 
evaluate the covariance precisely. At this point, a solution to the general case has not been found. 
However, suppose h(u, 9) can be written as 

h(u, 0) = 1 - hi(9)u + h 2 (u, 9)u 2 , 

such that < essinf h\ < ess sup h\ < oo under v and < essinf hi < ess sup hi < oo under t x u, 
where £ is the Lebesgue measure, then the following method can be used. First, fix r$ such that 
1 - hi{9)r > for u-a.e. 9. Let 

fi(du | 9) = 1 {0 < u < r } u~ a ~ l [l - hi{9)u\ du 

and n(du,d9) = fi(du\ 9)v(d9). Decompose A = fj, + Ai, with Ai = A — yu. For < u < tq, 
Ai(du|0) = ir - 1 ^^) - 1 + h 1 (9)u]duu(d9) = u~ a+1 h 2 (9,u)duu(d9), while for u > r , 
Ai(dn | 9) = \{du \ 9). As a result, Ai is a Levy measure with 

I Ai(d«, d9) = I v{d9) I ° u~ a+1 h 2 {9, u)du+ 1 1 {u > r } A(d«, d0) < oo 

and hence it gives rise to a compound Poisson random variable. We therefore only need to apply 
PGN approximation to [i. It is easy to compute J u 3 [i{du\ 9) for j > 2. Then the functions r T (9), 
p T (9), s T (9), and m T {9) can be fixed following Example 4, although the calculation is more tedious 
due to the extra term h\(9)u. □ 
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4.2 Error bound for approximation 

We next consider how well X is approximated by A r + T T . For any symmetric p.d. matrix X, 
denote by X 1 ' 2 the unique symmetric p.d. matrix whose square is equal to E. Given r > 0, let 

S r = Y{X T ) = V(T r ), A T = sy 2 , K* = t 2 A- x K v A- x . 

Let C\ and C2 be the same constants defined in Section 3.2. Denote, for z > 0, 

g T (z) = smallest eigenvalue of CiA~ 1 M T (z)A~ 1 and K* 



with M T (z) = I u 2 ee'l\u< r T (6) A — J- 



A T (du|0)z;(d#). 



(23) 



Finally, denote 

h{d) = [d/2\ + 1. 
Theorem 2. Given r > 0, suppose that under v, 

essinfp T (0) > -1, es S sup [PT(g) + 3]Sr(g) <1 (24) 

i?* :=esssup||A7 1 0||r T (0) < 1, S* := ess sup \p T (6) + h(d) + 1}\\ A' 1 6\\s T (9) < 1 (25) 
and for some q > 5, K a) x T = i^a,T T for 2 < \a\ < q and 

fOD 

/ s 2 q+ 2h(d)+d-l e - eT (l/s)s^ ds<00 _ (26) 

Jo 
Then 

G(d,q,r) 



d TV (X,A T + T T ) < 



where \\x\\\ stands for the L 1 norm \xi\ + • • • + \xd\ an 



I WuA^eWlXridu, d9) + f \\uA- l e\\ q llr {du, dO) 



(27) 



G(d,q,r)=c(d,q)Jl+ T L d , q (s)e-^W s ) s2 ds 

y Ji/R* 

with c(d,q) being a constant only depending on (d,q) and Ld, q {s) a polynomial of order no greater 
than 2q + 2h(d) + d — 1 whose coefficients are constants only depending on (d,q). 

Remark. 

1. Although A T appears in the bound, it is not used in the actual construction of T T or A T , and 
therefore does not generate a computational problem. 

2. A drawback of the bound in Theorem 2 is that, although asymptotically, the error rate can be 
significantly better than normal approximation, the constant coefficients in the bound, i.e., 
c(d,q) and those in Ld„(s), are very large even for modest d. Perhaps alternative methods 
for normal approximation (e.g. [3, 8, 28]) could be employed to improve these terms, or even 
replace G(d,q,r) with a universal constant that only depends on (d, q). 

In Theorem 2, the inequalities in (24) are the easiest to establish. On the other hand, R* and 
S* need more careful treatment as they involve A r . By (24), S* may be bounded via R*. The 
main technical term in Theorem 2 is G(d,q,r). The next result, which will be proved in Section 
5.2, provides some simple criteria to bound R* and G(d,q,r). 
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Proposition 7. Under v, the following statements are true. 

1) Let c = c(K u ) > be a square root of the smallest eigenvalue of K u . Then sup 5 H-A" 1 ^ < 
l/(cr). // 

lim ess sup r T (9)/r = 0, (28) 

then R* = o(l) as r — )• 0. 

2) Given b > 1 and a > 5, iftere is M = M(b, q, K u ) > 0, such that, if, in addition to (28), 

lim sup — ^ ess sup / u 2 X T (du\9) < b, (29) 

r^0+ T^ J 

liminf-= — — -essinf / u 2 l {u < r} X(du I 9) > M, (30) 

r->0+ r 2 ln(l/r) 7 

i/iere (26) holds and G(d,q,r) = c(d,q) + o(l) as r — >• 0. (Note the variate in (30) is r, not t.) 

Example 5 (Continued). Note the assumption in (22). By r T (0) = J 4 (a(6 , ))r 2 /( 2 " a ( e )), (28) 
is satisfied. By fu 2 X T (du\9) = J (a(9))r T (9) 2 - a ^ = J{a{9))r 2 for some J(a) e C(0,2), (29) 
is satisfied. Since Ju 2 l{u < r} X(du\9) = r 2 ~ a ( e >/(2 — a(9)), then (30) is satisfied no matter 
the value of M. Thus we can apply Proposition 7. Since p T (9) = ir(a(9)) > and (p T {9) + 
3)s T (9)/r T (9) = (p T (9) + 3)Ji(a) < 1, the conditions in (24) are satisfied. The last condition 
we need to check that for small r > 0, S* < 1 in (25). However, by (24) and esssupp T (#) = 
esssup7r(a(0)) < oo, S* = 0(R*) = o(l). 

We now can apply Theorem 2. By ||x||i < V"||x|| for x G M. d and by ll^l" 1 ^!! < l/(cr), where 
c is the constant in Proposition 7, 



\uA~ 1 9\\ q 1 X T (du,d9) < d q ' 2 t u q \\A; 1 9\\ q 2 X T {du,d9) 
<d q l 2 {cT)~ q I u q X T (du,d0) 



= (cT/Vd)- q / ^^M U(d9) < ^(9-2)00/(2-00) 

J q~ a{9) 

where d is a constant independent of r. For J ||tij4~ 1 0||^7 r (dt(, d0), a similar bound holds. Com- 
bining these bounds and Proposition 7, d^y(X, A T + T T ) = 0(T^ q ~ 2 ^ a °^ 2 ~ a °' > ), where q = 10. 

Finally, we compare the computational complexity of the above PGN approximation and the 
normal approximation for X [11]. To make a reasonable comparison, assume A is direction in- 
dependent, so that X(du,d9) = u~ a ~ 1 v(d9), where a £ (0,2) is a constant. Then given r, both 
approximations use r T = J^a)^'^ 2 " "' as the cut-off value for jump size and sample A T , which 
involves N\ ~ Poisson(a~ 1 (r~ a — r ( 7 a )i/(S')) events. However, the PGN approximation also samples 
Y T , which involves another N2 ~ Poisson( J(a)r~ a is(S)) events. As r — > 0, N2 = O p (Ni), and hence 
the approximations have the same order of complexity. On the other hand, by Theorem 2, the div 
between X and A T + T T is 0(r 8a '( 2-a '), whereas the dry between X and its normal approximation 
is 0(r 2a '( 2 ~ a '). Therefore, at least asymptotically, the PGN approximation has higher but the 
same order of computational complexity as the normal approximation, and the extra complexity 
may lead to significant improvement in error rate when a is not too small, e.g., a > 1/8. □ 
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5 Proofs of main results 

5.1 Univariate case 

To prove Theorem 1, we can assume that 

i 

i 2( g+ l) e -2L(t,r-) di<00 _ (31) 



/0 

Otherwise, Q q +i = oo and the result is trivial. We need the following two lemmas. 

Lemma 1. 1) Let £ be i.d. with ^(i) = J(l + itu — e ltu ) v(du) and E|£| J ' < oo for all j > 1. Given 
e > 0, let Z ~ iV(0,e 2 ) be independent of £. Then ^ +z G ^(R). 

2) Under condition (31), f Xr G C q (R), and for < j < q, /^(x) ->• as |x| -> oo. 

The second lemma is as follows. Note that it does not require matching of cumulants. 

Lemma 2. Let T r be defined as in Theorem 1, such that s(r) < l/(p + 3) and o~(r) > 0. Fix e > 0. 
Given A, B > with A + B = 1, let W be an i.d. random variable with 

* w (t) = A* Xr (t) + B* Tr (t) + e 2 t 2 /2. 

Let £ = W/V, w/iere z/ = ■ s /AK 2t x r + ^ K 2,r r • 77ierc /^ G ^(R) and /or j > 1, 

|if(x)l dx < j/j_i(r) + /,(r) + (1 + e 2 /^)/i+i(r), 

where for j > 0, Jj(r) > suc/i i/iai 



/; .( r )2 = ^+1 



2jD (r)2j+i J 1/r 



D{rf = ACWx r + B(CfC 2 K 2 ,Yr + ^(0 2 ), 
AC?* 2 /• .. 2wj .., , Sa(r) 2 t 2 

'«<i/|t| 



H(t,r) = ^ f u 2 X(du) + 

1 Ju<l/\t\ 



To prove Theorem 1, by dxv(^> A r + T r ) = dTv(A r + X r , A r +T r ) < d^y{X r ,T r ), it suffices 
to show (13) for dTy(X r ,T r ). Let Z and Z' be i.i.d. iV(0, 1) random variables independent of X r 
and T r . Fix e > 0. Letting h be a measurable function with ||/i||oo < L our first goal is to bound 

A e = E[h(X r + eZ) - h(T r + eZ')]. 

For n > 2, we have representations 

X r + eZ = U 2 + ■ ■ ■ + U n+1 , T r + eZ' = V 1 + --- + V n , 

(note the index of U starts at 2), where Ui and Vj, i, j = 1, . . . ,n + 1, are independent i.d. random 
variables with 

V Ut (t) = n- l ^ Xr +ez{t), V Vi (t) = n-^Tr+eAt). 
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For fc = 1, ... ,n + 1, let 



l<j<fc k<j<n+l 



and 5 fc (x) = E/i(W fc + x). By X r + eZ = W\ and T r + eZ' = W n+ i, it is clear that 

|A 6 | = | 3l (0) - 5 „ +1 (0)| 

< |E[5i(C^i) - 5n+1 (K+i)]| + |E[ ff i(C/i) - <7i(0)]| + |E[ 5n+1 (K+i) - 5n+i(0)]|. (32) 

We bound the expectations on the last line separately. By W k + V k = W k+ \ + U k+ \, 

n+l 

h(Wi + C/i) - h(W n+1 + V n+1 ) = J>(W* + U k ) - h(W k + V k )}. 

fc=l 

By independence, ~Eh(W k + U k ) = ~Eg k (U k ) and Eh(W k + 14) = ~Eg k (V k ). Therefore, taking expec- 
tation on both sides of the displayed identity yields 

n+l 

HgiiUi) - 9n+i(V n+ i)} = ^2n gk (U k ) - g k (V k )}. (33) 

fc=i 

1/2 

Denote v = k 2 x . Let £& = W k /v. By Lemma 1, /^ G ^(M). As a result, 



gk(x) = / h(uu + x)f^ k (u)du = / h(uu)f^ k (u - x/v) du 



is smooth. By Taylor expansion around 0, 

s fc (£4) - ttWk) = E 9j ^rK ~ v i) + ^\9 h q \0{U k )U k )Ul - g { k q \e(V k )V k )V k % 
j=l J - q - 

where 9{x) G [0,1]. By assumption, Kj,x r = Kj,T r for 1 < j < q. Since Kj,u k = n~ 1 Kj : x r +eZ = 
n _1 («j,x r + e 2 l{i = 2}), and likewise Kjy k = n~ l (Kj yTr + e 2 l{j = 2}), then Kjjj h = K j,v k for 
1 < i < 9- As a result, ££/? = EV/? for 1 < j < q and hence 



1 



Sl)(Q(TT.\ir.\TT<l J-l)( 



n 9 k(U k )-g k (V k )] --= -K[gf\e(U k )V k )Ul-g^{e{V k )V k )V«] 
giving 



?! 



|%*(£4)-<7fc(^)]| < ^^[El^r + El^l"]. (34) 



Since g k \x) = (—u) q f h(vu) ff' (u — x/v) du, then 



bailee <^||4 9) (u)|du. (35) 



Because 

^ Wk {t) = {k- l)V Vl (t) + (n + 1 - fc)*^*) 

n + 1 - /c . . fc - 1 T , . e 2 t 2 

-^x r (t) + ^^^T r (t) + — , 



?? 
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we can apply Lemma 2 with v 2 = K2,x r = K 2,T r , A = (n + 1 — k)/n and B = (k — l)/n therein. By 
definition of D{r) and H(t, r) in Lemma 2, 

D{r) 2 = AC 2 v 2 + B{C 2 C 2 k %Yt + a{r) 2 ) > AC\v 2 + 5C 2 C 2 ( K2 ,y r + a(r) 2 ) > C\C 2 v 2 

and 

AC\t 2 f 2 5a(r) 2 i 2 



H(t,r) = ^L f u 2 X(du) + 

1 Ju<\l\t\ 



u<l/\t\ 2 

> i^±^! min <| c 2 i u 2 A(du), <x(r) 2 [> = L{t 



min J Cf f 

[ Ju<l/\t\ 



By definition of Qj(r) in Theorem 1 and definition of Ij(r) in Lemma 2, Ij(r) 2 < Qj(r) 2 . By 
condition (31), Qj(r) 2 < oo for < j < q + 1. Thus (35) and Lemma 2 give 

Halloo < «/~ 9 [<?Q,-i(r) + Q q (r) + (1 + e 2 /z/ 2 )Q g+1 (r)] := M e < oo. 
Since M e is independent of k, by (33) and (34), 

n+l n+l 

\Eg 1 (U 1 ) - Eg n+1 (V n+1 )\ < ^ \E[g k (U k ) - g k (V k )}\ < -± ^(E\U k \ q + W)- 

fc=l 1' k=l 

Since the Levy measure of X r has bounded support, E\X r + eZ\ q < oo. Meanwhile, from (5), 
E\Y r + eZ\ q < oo. Then by Lemma 3.1 in [1], 

ra+l n+l 

^2K\U k \ q -+ \K\ q ,X r +eZ = \n\ q ,X r , ^E|Vfc| 9 -+ \n\ q , Tr+€Z > = \K\ qj Y T - 
k=l fc=l 

As a result, 

M t 
limsup \Egi(Ui) - Eg n+1 (V n+ i)\ < —r(\K\ qt x r + N 9 ,yJ- (36) 

On the other hand, |E[ 5l (C/i) - ffl (0)]| < || ff 5_|| 00 lE|C/' 1 |. Since EUf = Y(U X ) = Y(X r + eZ)/n, 
by Cauchy-Schwartz inequality, as n — > oo, E\U±\ — > 0. As in (35), ||<?i||oo < v~ x / |/l(w)| du. By 
/cj € o5^(K), H^ Hoc < oo. Since g\{x) = Eh(X r + eZ + x) is a function independent of n, it follows 
that E[gi(Ui) — <7i(0)] — > as n — > oo. Likewise, E[g n+ i(V n+ i) — g n+ i(0)] — > 0. Together with (32) 
and (36), this implies 

\Eh(X r + eZ) - Eh(T r + eZ')\ < ^- (|«|, iXr + |«| ff ,y r ). 

Let G C K. be the union of a finite number of (at, b{) and h(x) = l{x £ G}. By 2) of Lemma 
1, P{X r = en or &j, some i} = 0. Then /i(X r + eZ) — h(X r ) — > a.s. as e — > 0+. On the 
other hand, since T r is the sum of Y r and an independent nonzero Gaussian random variable, 
by 1) of Lemma 1, f Tr G ^(R). As a result, h(T r + eZ') - h(T r ) -+ a.s. as e ->• 0+. Also, 
M e — )• M := v~ q [qQ q -i{r) + Q q (r) + Q q +i(r)]. Thus, by dominated convergence, 

M 
\F{X r e G} - F{T r e G}\ < -r(\n\ q ,x r + |«|,,y r )- 
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Let A C R be measurable. Given 5 > 0, there is R > 0, such that, letting B = An (—R,R), 
F{X r e A\B} + F{T r G A\B} < 5. Then there is an open GD B with £(G \B)<6, where £ is 
the Lebesgue measure. G is the union of at most countably many disjoint open intervals (aj,6j). 
Let G k = Uf =1 (a h bi). Then 

\¥{X r ei}- F{T r G A} | 

< \F{X r G G fc } - P{T r G G fc }| + P{X r G G \ G k } + P{T r G G \ G k } 

+ F{X r €A\G}+ F{T r eA\G} + F{X r G G \ A} + F{T r eG\i) 

M 

< -r(Kx r + *\ q ,Yr) + nx r € G \ G k } + F{T r G G \ G k } 

+ F{X r eA\B}+ F{T r eA\B} + (\\fx r \\oo + ||/tJoc) i(G \ B) 

M 

< — (\n\ q , Xr + \K\ q , Yr ) + F{X r €G\G k }+ F{T r G G \ G k } 

+ (l + \\fx r \\oo + \\fT r \\oo)S. 

By Lemma 1, H/xJIoo + H/tJIoo < oo. Then, letting k — > oo followed by 5 — > yields 

\F{X r G A} - F{T r GA}]\< ^{\K\ q , Xr + \n\ q , Yr ). 
This completes the proof of Theorem 1. 

5.2 Multivariate case 

We shall prove Theorem 2 by standardizing the random variables involved. First, we have the 
following simple result. 

Lemma 3. Let X be i.d. with Levy measure A(du, d9) in polar coordinates and A be a nonsingular 
matrix. Then A~ 1 X has Levy measure \{J A (•)) and variance A~ 1 N(X)A~ 1 , where 

J A (u, 9) = (wa-^Wu, rp4?) > ( u < e ) G (°> °°) x s - 

Furthermore, JJ 1 ^^) = J A -i(u,9) = (\\A6\\u, A6/\\A6\\). 
Proof. Denote h(z) = 1 + z — e z . For any t G M rf , 

<H A -i x (t) = Vx(A-H) = f h(i(A- l t,9)u)\(du, d0) 

h(i(t,A- l 9)u)X(du, d6) = f h(i(t,oj)v)X(du, d9), 



where (v,co) = J A (u,9). Then the lemma easily follows. □ 

Recall A T = T^J 2 and K* = r 2 A- x K v A~ x , where S T = Y(X T ) = V(T r ). Let 

X* = A^Xr, T* = A~ l T T , Y* = A~ X Y T . 

Then X* and T* are standardized, i.e., EX* = ET* = and V(X*) = V(T*) = I. It is easy to 
check that if K a ,x T — K a,T T f° r 2 < |a| < q, then n a ^x* = K-a,T* for 2 < \a\ < q. By Lemma 3, 
X* and Y* have Levy measures A*(du, d9) = A r (dt>,dw) and r y*(du,d9) = r y T {dv,dui), respectively, 
where (v,co) = J^(u,9), and T* = Y* + Z*, where Z* ~ N(Q,K*) is independent of Y*. 
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Lemma 4. Under conditions (24) and (25), E||X*|| a < oo and E||Y*|| a < oo for any a > 0. 
The next lemma is analogous to Lemma 1. 

Lemma 5. Given r > 0, the following statements are true. 

1) Under condition (24) and (25), for any a > and b > with a + b > and e > 0, if £ is 
i.d. with <!>£ = a^ x * + b^y* and Z ~ N(0, e 2 I) is independent of £, i/ien -0^+z £ ^(M d ). 

£) Under condition (26), /x* G C q {Mr) and for each \a\ < g, f^{x) — > as |x| — )• oo. 

The next lemma is analogous to Lemma 2. 
Lemma 6. Given e > and ^4, -B > u>i£/i A + B = 1, let £ be an i.d. random variable with 

^(t) = A^x* (t) + B^ T * (t) + e 2 \\tf/2. 
Then, under condition (25), for m > 3, 



max / \&\x)\dx<{l + e 2 /d) h ^c{d,m)Jl + / L d>m (s) e -^( 1 /«)« 2 da, 

H=mJ ? y 7l/fl* 

where c(d,m) is a constant only depending on (d,m) and L^ m {s) a polynomial of order no greater 
than 1m + 2h(d) +d— 1 whose coefficients are constants only depending on (d,m). 

The proof of Theorem 2 is similar to the one for Theorem 1, so we will only give its sketch. By 
d TV {X,A T + T T ) < d TY (X T ,T T ) = d Ty {X*,T*), it suffices to show (27) for d Ty (X*,T*). Let Z 
and Z' G M. d be i.i.d. N(0,I) random variables independent of X* and T* . Given e > 0, for n > 2, 

X* + eZ = U 2 + • • • + C/ n+ i, T* + eZ' = V 1 + --- + V n , 

where U% and Vj are independent and i.d. with ^u i = n~ 1 ^ x*+eZ and VlJy. = n~ l ^>T*+eZ'- Let 
Wfc = Vi + • • • + Vk~i + Uk+i + • • • + U n+ \. Given a measurable function h with ||/i||oo < 1, let 
0b(x)=E/i(Wjfe + aj). Then 

|E[/ipT + eZ)-/i(Y* + eZ')]| 

n+l 
< J^ |E[5 fc (C/ fc ) - g k (V k )}\ + |E[5i(C7i) - 5l (0)]| + |E[<? n+1 (K+i) - <7„+i(0)]| 
fc=l 

Since (?fc(x) = J h(u)fw k (u — x) du and \\h\\oo < 1, by Lemma 5, #fc G C°°. Then 

9k(U k ) - g k (V k ) = Y, 9jL ^r-m-V k a )+ J2 ^[9 { k a \0(Uk)U k m - 9 ( k a \0(V k )V k )V k % 

\a\<q |o|=9 

where 9{x) G [0, 1]. Since EE/£ = EV fc a for 2 < \a\ < q, then 

\n 9 k(U k )-g k (V k )]\ < J2 ^^[E|^|+E|V; Q |]. 

\a\=q 
By Lemma 6, for \a\ = q, 



lbi a) ||oo< I '\f^ k (u)\du<(l + e 2 /d) h ^G(d, q ,r) 
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which is finite by (26). Note \u a \ < \\u\\ g . Then by Lemma 4 and Proposition 1, as n — > oo, 

n+l n+l 

fe=i fc=i 

From here, an argument similar to that for Theorem 1 leads to 

|«|a,X* + |«|a,r* 



d TY (X*,T*)<G(d,q,r) ]T 



a! 

\a\=q 



Since 



|a|=g 

E ^? / I K"^)1 A T (dn, M) = ±J || (uA-^)||f A r (du, d0) 



|a|=g |a|=g 



|a|=9 
and a similar expression holds for X^|a|=o l K la,T*/«!) the proof of Theorem 2 is then complete. 

Proof of Proposition 7. 1) By t 2 K u < E T , the smallest eigenvalue of S T is at least r 2 c(.Ki,) 2 , 
yielding the first assertion and R* = O(esssupr r (0)/r). Then by (28), R* — > as r — > 0+. 
2) By condition (29), for small r > 0, S T = f 99' u(d9) fu 2 X T {du | 9) < br 2 K v and hence 

t 2 K u > ciE T , (37) 

where ci = 1/6 > 0. Denote 



g T (s,9) = [u 2 l\u<r T (e)A „ * „ ) 
J { \\A T V\\s ) 



A(du|0). 



Then M T (l/s) = J 66'g T (s, 9) u{d9). If l/R* < s < ...A. .„. , then by definition of r T {9), 

\\A. T p||7V(yJ 



g T (s,9) = I u 2 l{u<r T (9)} X(du\9) = t 2 . 



If s > .-iL — 77T) then by HA,. 1 ^! < l/(cr) and condition (30), with M to be determined, 

g T (s,9)= fu 2 llu< \ | X(du\9)> fu 2 l{u<—} X(du \ 9) > ^lJ£ \ n ± , 
Thus, for s > l/R*, g T (s,9) > t 2 F t (s), where 



c 2 M s 
r T (s) = mm <j 1, — — In 



S"* CT 



and as a result, by (37), M T (l/s) > t 2 F t (s)K u > Cl F T (s)E T , giving A~ 1 M T (l/s)A~ 1 > aF T (s). 
Also, by (37), K* = t 2 A~ x K v A- x > c x I. Thus, there is c 2 = c 2 {b,K v ) > 0, such that for r > 
small enough and s > l/R*, 

qJ1/s)s 2 > c 2 F T (s)s 2 = c 2 min(s 2 ,c 2 Mln— X . 

I CT J 

If M > is large enough so that c 2 c 2 M > 2q + 2h(d) + d, then 

/■oo /"OO f —ror 2 Ml 

/ s 2g+2/i(d) +( i-l e -^(l/s)s 2 ds < / s 2g+2/i(d)+d-l I e -c 2 s 2 + M 2 I dg 

J l/R* " J l/R* I Vcr/ J 

is o(l) as r — ?• 0. It then easily follows that G(<i, g, r) = c(d, q) + o(l). □ 
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6 Proofs of auxiliary results 

6.1 Proposition 1 

Given < a < b < oo, let < h{u) < 1 be a continuous function with compact support in M d \{0}, 
such that h(u) = 1 if ||it|| 6 [a, b]. As pointed out in Section 2.2, 



lim nE[h(U n )f(U n )} = / h(u)f(u) \(du) < oo, 

n— >oo J 

lim nE[h(U n )g{\\U n \\)] = [ h{u)g{\\u\\) \{du) < oo. 

n— >oo J 

From the second line and monotone convergence it follows that 

liminf nE[g(\\U n \\)} > f g(\\u\\) A(du). 

Clearly, / G ^(A) if ff(||u||) G ^(A). Furthermore, 

|nE[/(*7 n )] - nE[/t(C/ n )/(t7 n )]| < nE[(l - h(U n ))g{\\U n \\)] 

<nE\g{\\U n \\)l{\\U n \\ <aor \\U n \\>b}}. 

Therefore, to prove the lemma, it suffices to show 

limsupnE[g(||iy)l{||Ey <a}\ -)• 0, as a -»• 0+, (38) 

n— >oo 

limsupnE[c>(||[/ n ||)l{||iy > 6}] ->■ 0, as 6 ^ oo. (39) 

n— >oo 

Let m(o) = sup 0< || t ||< a 5(t)/t 2 . For n > 1, since V(U n ) = V(X)/n and EU n = EX/n, 

nE[|p(||tT n ||)l{||l7 n || < a}} < m(a)nE[||[/ n || 2 ] 

= m(o)n[tr(V(f/ n )) + ||E^ n || 2 ] 
= m(o)[tr(V(X)) + ||EA:|| 2 /n]. 

Since ?n(a) — > 0, then (38) follows. Next, since g is continuous and non-decreasing, 

/>oo 

nE[ 5 (||C/ n ||)l{||C/ n ||>6}]= / nP{ ff (||ig|)> S ,||ig|>&}d S 

Jo 

/>oo 

= / nF{\\U n \\>g*(s)Vb}ds, (40) 

JO 
where g*(s) = inf{x : g(x) > s}. By Markov inequality, 

?iE||Lg| 2 tr(V(X)) + ||EX|| 2 /n 



nP{||C/ n || >x}< 



x 2 x 2 



Therefore, for large x > 0, nP{||[/ n ,|| > x} < 1. Since cot < 1 — (1 — t/n) n for £ G [0,1], where 
co > is a universal constant, then 

c nE{\\U n \\ >x}< 1-(1-P{||ty >x}) n = p( max \\U n , k \\ > x 1 (41) 

Kk<n 



23 



where U n> k are i.i.d. ~ U n . 

First, suppose X is asymmetric. Let V n i, . . . , V nn ~ U n be another set of i.i.d. random variables 
which are also independent of U n k- Fix 5 G (0, 1). By setting b > even larger, for all x > b, 



max \\V nk \\ < Sx } > 1 - nP{||l/ n || > Sx} > 1/2. 

Kk<n 



As a result, 



c n¥{\\U n \\ > x} < 2p{ max ||C/ nfc || > x) P j max ||Kfc|| < Sx 
< 



2PJ max ||C/ nfc -Kfc|| > (1 - <5)a4 <4P{||X'|| > (1 - 5)x} 

where the last line is due to the symmetry of U n k — V n k and [25], Proposition 2.3. Thus, by (40), 

4 P°° ( IIX'II 1 

nE[ 5 (||^ n ||)l{||C/ n || >b}]<-J PJ^I " 9 * {S) V6 J dS 



^f p MS)^S^ ds 

4 

— E 
co 



s(S)i{IM>(i-W 



If E[g(c||X'||)] < oo for some c > 1, then by choosing 5 > with 1/(1 — 5) < c, it is seen (39) 
holds. Moreover, since g(\\x-y\\) < g(2\\x\\) + g(2\\y\\), E[g(c\\X'\\)] < 2E[g(2c\\X\\)], and hence 
(39) holds once E[ 5 (2c||X||)] < oo. 

Finally, if X is symmetric, then U n is symmetric and from (41) and [25], Proposition 2.3, 
c nP{||C/ n || >x}< 2P{||X|| > x}. Then by similar argument, (39) holds once E[g(||X||)] < oo. 

6.2 Lemmas for univariate case 

Proof of Lemma 1. 1) From the assumption, J \u\ 3 A(d«) < oo for all j > 2. Then by dominated 



rOV-A _ f/1 f„'_1l jtu 



convergence, ^ G C°°(R) with tf jr (t) = /(l {j = 1} - e ltu ){iu) j v{du) for j > 1. By |1- e ia; | < 



X 



,(i), 



for x G M, |*J(*)| < K2,el<|- Clearly, \Vf(t)\ < \k\^ for j > 2. Since ^+z(t) = exp(-^(t) - 

e 2 i 2 /2), then for j > 0, ^} z {t) = Pj(%{t), . . . , ^f{t), t) ^(t) exp(-e 2 £ 2 /2), where i>-(z) is a 

multivariate polynomial in z = (zi, . . . , £j+i) of order j. It follows that | wfLz (*) I = 0(|ipe _e * ' 2 ) 

and hence for any p > 0, |£| p |w+zWI -^ as |i| — > oo, which yields the proof. 
2) For |£| > 1/r, 

Re[^x r (i)] = / (l-costn)A(du) > / (1 - cosftx) A(cht). 

Then by 1 — cosx > C±x 2 /2 for |x| < 1, 

Re[*x r (<)] > ^tt- / n 2 A(dn) > L(t,r). 



2 -'«<i/|t| 
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On the other hand, by Cauchy-Schwartz inequality, 



|t|>l/r 



\t\ q \^x r (t)\dt< 



(1/ 



1 + t 2 



1/2 



1/2 



|t|>l/r 



(l + 0^lfe(i)rdi 



<^ff(l + t 2 )t^e- 2L ^dt 



1/2 



Then by (31), |*| 9 |V>x r (<)| G ^(R) and the proof follows from Proposition 28.1 of [31]. 



□ 



To prove Lemma 2, we need a type of inequalities that are known (cf. [5], Lemma 11.6). Since 
the expression of (/)"' becomes complicated rapidly as j increases, the following specific form is 
used to reduce the maximum order of derivative involved. 



Lemma 7. Let f E y(R) and ip(t) = f. Then for j > 1, 

\f U) (x)\dx 



< 



V2 



(li^wi 2 ^) +j (yv~v(t)i 2 cu) + 



|*V(*)I d* 



1/2' 



Proof. By Cauchy-Schwartz and Minkowski inequalities 

dx ^ ' 



\f U) (x)\dx< 



1 + x 2 



\f^{x)\ 2 {l + x 2 )dx 



1/2 



< v 7 ^ 

1 

"72 



|/ W) (s)| 2 dx 



|*ty(t)| 2 d* 



1/2 



+ 



|x/^')(x)| 2 dz 



1/2' 



1/2 



+ 



l(*V(*))Td< 



1/2' 



where the last line follows from Plancherel theorem and the fact that Fourier transforms of f^'(x) 
and xif(x) are (—itytp(t) and (— i)^ (£)> respectively ([18], p. 100-102). The proof is complete 
by applying Minkowski inequality to the last integral. □ 

Proof of Lemma 2. We only consider the case where sppt(A) C R+. The proof for the symmetric 
case is similar. For brevity, write / = ft, tp = tp^, and *$> = *$>£. By Lemma 1, /, tp € J»^(R). Write 
M = e 2 + B<r(r) 2 . Then 



Re[*(i)] = Re^wW")] = / ( x ~ costu/u)[A\ r {du) + Bj r (du)\ + 



Mf 
~2^ 



If |i| < v/r, then \tu\/v < 1 for < u < r. Since 1 — cosx > C 2 x /2 for |x| < 1, 

-2,2 ,-r m 2 



Re[*(t)] > ^- [" u 2 [AX r (du) + J3 7r (du)] + 



2i/ 2 



^C 2 K 2 , Xr t 2 J3C7 2 m(r),(r)^ 3 t 2 r/'M „ + , _ u 



2i/2 



+ 



2i/ 2 



/ +z e- u dn + 



Mt 2 
~2^' 
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Since s(r) < r/(p + 3), 

rr/s(r) fP+3 

/ u p+2 e~ u du> u p+2 e~ u du > C 2 T{p + 3) . 

Jo Jo 

Then by T{p + 3)m(r)s(r) p+3 = n 2 ,Y r i 

Re[*(t)] > ^ + ^2 + ^T 

ACl^Xrt 2 BCfC 2 K 2iYr t 2 Ba(r)H 2 D{r)H 2 
2v 2 2v 2 2i/ 2 2v 2 ' 

If |t| > v/r, then r > v/\b\ and 

/■ 

i<i//|t| 
Therefore, for j > 0, 



Re[*(t)] > ^P / « 2 A(d«) + ^^ = H(t/u, r). 

lv Ju<u/\t\ Zv 



f\t^(t)\ 2 dt = 2 r&e-^Wdt 

pvlr poo 

< 2 / t 2 Je -D(r)V/^ dt + g / #l e -2HW>,r) d t 

JO Jv/r 

/*GO /'OO 

< 2 / f 2; e -D(r)^/- 2 dt + 2^i+i / i?i e -mt,r) dt 

Jo Jl/r 

v 2 J +1 T(j + 1/2) 2 ;+i f 00 2j 2H (tjP) d _ , , 2 

" D(r) 2 ^ 1 + i 1/r dt-^trj 

Next, V'(*) = -W(t)^(t), with 

tf'(i) = - /(l - e itu /> L4A r (du) + B 7r (d«)] + ^ 
v J v l 

By |1 - e ix \ < \x\ for all x £R, 



,2 ■ 



|*'(t)| < ^ /^ 2 [AA r (du) + B 7r (d«)] + 



As a result, 



Aft 

An 2 ,x r t BK 2 , Yr t (e 2 + Ba(r) 2 )t 

9 9 9 

yZ y£ y£ 

(AK 2}Xr +BK 2}Tr )t e 2 t 2 2 
5 1 k- = (1 + e jv )t. 



12 dt 



\tH'{t)\ 2 dt = J\t^'{t)mt 

< (1 + e 2 /^ 2 ) 2 / \t j+1 m\ 2 dt < 2(1 + e 2 /u 2 ) 2 I j+1 {r) 2 . (43) 

The proof is complete by combining Lemma 7, (42) and (43). □ 
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6.3 Lemmas for multivariate case 

In this section, we prove Lemmas 4-6. Recall that X* = A~ 1 X T and Y* = A~ x Y r have Levy mea- 
sures X*(du,d6) = \ T (dv,du)) and 7*(du, d9) = j T (dv,du)), respectively, where (v,u) = J^ (u,9), 
i.e. v = \\A T 6\\u, oj = A T 0/\\A T 6\\. Note that 

vuj = uA T 9, iiAr 1 ^!! 



\\A T 9\\ 
Then 

X*(du, 6.9) = 1 {v < r T {uj)} \(dv | w) i/(dw) = 1 {u < r*(9)} A*(du | 9) v*(dO), 
where 

r*(9) = r T (uj)/\\A T 9\\ = \\A~ l io\\r T {uj), \*{du \ 9) = \{dv | w), i/*(d0) = i/(dw). 
Similarly, 

7 *(du, d0) = m T (w)l {v > 0} B PrM e -«/'rM dt> i/(dw) 

= m T (w)l{u > 0} p r 0f-^ +1 n^^e-" A ^ll n /^^dnz/(dw) 
= m*(0)l {u > 0} u p *^e- u/s * {e) 6uu*(69), 

where 

m*(0) = m r (u)\\A T 9\\ Pr(/j)+1 , p*{d) = Pt (uj), s*(9) = s T {uo)/\\A T 9\\ = P~ 1 o;||s r (w). 

Therefore, 

7*(dn, d0) = 7*(dn | 9)u*(9), with 7 *(du | 9) = m*(9)l {u > 0} u p ' {e) e- u/s ' {e) du. 

Lemma 8. If (24) holds under v, then 

■ r ./m o [p*(0) + 3]s*(0) _ , , 

ess imp (9) > —2, ess sup — < 1, under v . 

r*(0) 

Furthermore, for R* and S* in (25) defined under u, we have 

R* = ess sup r*(9), S* = esssup[p*(0) + h(d) + l]s*(0), under v* . 

Finally, for g T {a) is defined in (23), we have 

Qr(z) = smallest eigenvalue of C 1 M*(z) and K* 

with M*(z) = f u 2 99'l{u<r*{9) A z} A*(du, d0). 



Proof. The lemma is straightforward except for the assertion on g T (a). By change of variable 

(u,9) = J At (v,u), 

A T M*(z)A T = I \uA T 9){uA T 9)'l {u < r*{9) A z} X*(6u, 69) 

(vw)(vu)'l{\\A^ 1 u\\v < (WA^uWrricj)) A z) \ T (dv,du). 
Since the right hand side is M T (z), M*(z) = A~ 1 M(z)A~ 1 . Then the assertion on g T {z) follows. □ 
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Proof of Lemma 4- Fix a > 3. Since the Levy measure A* of X* has bounded support according 
to condition (25) and Lemma 8, E||X*|| a < oo. 



ii 



a j*(du,d8) = / m*(0)i/(d0) / u p ' {e)+a e - u/s ' {e) du 

m*(8)r{p*(6) + a + l)s*{9) p *^ +a+1 v{d6). 



Letting 



b = ess sup 



r(p*(9) + a + l)s*(8) 
T(p*(6) + 3) 



*(Q\a-2 



then, by the construction of Y* , 

u a j*(du,d6) < b f m*{e)T(p*(9) + 2,)s*{9f^ +z u(de) 

= b I v(dd) fu 2 X* T (du\9) = b I u 2 \*(du,d8). 



Since b < esssup[(p*(#) + a)s* 



a-2 



and by conditions (24) and (25) as well as Lemma 8, 



esssup[(p*((9) + a)s*(<9)] < ess sup < oo, 

p*{p) + 3 

then b < oo. Thus f u a ^*(du,d6) < oo, giving E||y*|| a < oo. 

Lemma 9. Given r > 0, under conditions (24) and (25) , ^>x* and \Py* G C°°(IR d ) suc/i i/iai 

\d j m x .(t)\<d\\t\\, \d j * T *(t)\ < d\\t\\, j = i,...,d 

and for \a\ > 2, 

p*{9) + \a\ 



a 



\d a V x *(t)\ < (R*) la{ - 2 d, \d a ^ T *(t)\ < [S* ess sup 



p*(0) + h(d) + l 



d. 



In particular, if 2 < \a\ < h{d), then \d a ^ T *(t)\ < (S*)\ a \~ 2 d. 

Proof. By Lemma 4, f \\u\\ a X*(du,d8) < oo for a > 2, so by dominated convergence, ^x* £ 
C 00 ^), dj^ x *(t) = fm6j(l - e iu ^)X*(du,d9), and d a ^ x *(t) = - f(iu9) a e [u ^ X*(du,d9) for 
\a\ > 2. For $y*, similar formulas hold. From \dj^ x *(t)\ < \\t\\ f u 2 \*(du,d6) = ||t||tr(V(X*)), 
the first inequality follows. Likewise, \dj^ Y *(t)\ < ||t||tr(V(F*)). Together with \\dj(t'K*t)\ < 
\\K*t\\ < ll^l|t r (^*)) this implies the second inequality. Next, for \a\ > 2, from 

\d a ^ x *(t)\ < I v) al l{u<r*(9)}\*(du,de) 

< (R*)^' 2 fu 2 X*{du,d9) = (R*)^- 2 tr{Y(X*)), 
the third inequality follows. Finally, for |a| > 2, let 



C = ess sup 



«*(*) 



| tt[ - 2 r(p*(g) + |a| + l) 

T{p*(9) + 3) 



2n 



Then by Lemma 8, 

//"CO 
u*{d9) u H 7*(du|0) 

m*(9)[s*(9)] p *^ + ^ +1 F(p*{9) + \a\ + 1) u*{dO) 

< C f m*{9)[s*(9)f^ +3 T( y p*( y 9) + 3)u*(d9) 

= C I u 2 7 *(dn, 60) = Ctr(V(y*)). 

Since \a\ > 2, then by (25) and Lemma 8, 

S * eSS SUP p*(0) + h(d) + l ) 
Also, \d a ^ z *(t)\ < tr(K*)l {\a\ = 2}. We therefore get the last inequality. □ 

Lemma 10. Let C\ and C2 be the same constants defined in Section 3.2. Given r > 0, under 
conditions (24) and (25), the following statements are true. 

1) If\\t\\ < 1/R*, then 

Re[*x*(£)]>^|^, Re[* T »(t)]>^f^. 

2) If\\t\\ >1/R*, then 

Re[ *,. (()1 > «W, ^ T . {t)] > *jp. 

Proof. 1) Given \\t\\ < 1/R*, by Lemma 8, for v*-a.e. 9 G S and < u < r*(6), \{t,9)u\ < 
\(t,9)\r*{9) < 1, yielding 1 - cos((t,9)u) > Cf(t,9) 2 u 2 /2. Therefore, 

Re[*x*(t)] = /[I - cos((i, 9)u)]l{u < r*(9)} X*(du \ 9) v*(d9) 

> 0- f(t, 9) 2 u 2 l {u < r*{9)} \*{du I 9) u*{d9) = Q- I ' (t, 9) 2 u 2 A*(du, d9). 

Since the last integral equals t'Y(X*)t = \\t\\ 2 , we get the first inequality in 1). Next, by (25) and 
Lemma 8, for u*-a.e. 9 G S and < u < s* (9) \p* (9) + 3] , \(t,9)u\ < \\t\\u < r*{9)/R* < 1. Then 

/• S *(0)[p*(0)+3] 

l-cos{{t,9)u)]j*(du\9)> / [l-cos({t,9)u)]j*(du\9) 

Jo 

>^~ / (t,9) 2 u 2 7 *(du\9) 

1 Jo 



>^1 / (t,9) 2 u 2 j*(du\ 



Then 



Re[*y.(t)] = [[l-cos((t,9)u)]j*(du\9)v*(d9) 

> ^M f(t,9) 2 u 2 1 *(du,d9) = ^t'Y(Y*)t 
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and hence 

Re[* T .(t)] = Re[^y.(t)] +Re[tf z .(i)] 

> 9l^l t 'Y(Y*)t + -t'K*t > 9l^l t '[Y(Y*) + K*]t. 

Since V(Y~*) + K* = V(T*) = I, then the second inequality in 1) follows. 
2) The first inequality follows from 

Re[*x-(*)] > f[l - cos((t,0)u)]l \u< r*{6) A -^-\ X*(du,d9) 



2 J { \\t\\) K ' 2 

and Lemma 8, while the second one from Re[*r*(t)] > Re[^^*(i)] = t'K*t/2. D 

Proof of Lemma 5. The proof for 1) is completely similar to that for 1) of Lemma 1, except that 
it is based on Lemma 9. To prove 2), let k = h(d). By Cauchy-Schwartz inequality, 

V2 ft M \ 1/2 



J ||i|HVx* (t)\ dt < (| ||i|| 2 «(i + ||t|| 2fc )|^ (t)\ 2 dtj (| - 



dt 



+ t 



2/v 



The second factor on the right hand side is finite. By 2) of Lemma 10, for ||t|| > 1/R*, \ipx*(t)\ ^ 
exp{-Re[# x *(*)]} < exp{-£ T (l/||t||)||i|| 2 /2}. Then 



[ ||t|| 2 "(l + ||i|| 2fc )|Vx.(t)| 2 dt< [ ||t|| 29 (l + ||t|| 2fc )e-^ (1/l|t|l)l|t||2 dt 

J\\t\\>l/R* J\\t\\>l/R* 



where c(d) is a universal constant. Then by condition (26), ||t|| ? | , 0x*(i)| £ L 1 (M ) and the proof 
follows from Proposition 28.1 of [31]. □ 

To prove Lemma 6, we use the following variant of Lemma 11.6 of [5] which involves lower order 
of partial derivatives. 

Lemma 11. For m > 1, there is a constant ci(d,m), such that for f G ^(R ), 



max 

|o|=m , 



f \f {a \x)\dx< ci(d,m)maxJ J f \t^J{t)\ 2 dt : |/9| < m, 1 < i < d, < j <h{d) 
Proof. Denote k = h(d) and w(x) = x 2k + • • • + x 2fc . By Cauchy-Schwartz inequality, 



\f ia \x)\dx < J / 1+ d ^ ) A / / \f^(x)\Hl + w(x))dx. 



First, 

<d[-^j:[ dx 2 ---dx k =d2 d I =c'(d). 



dx , /" dxi /" , , , j f°° x d 1 dx 

n r^ ^ d TT^k / dx 2 ---dx k = d2 d — — ^ 

1 + W(x) J 1 + Xf 7o<|xi|<|xi| Vo 1 + 3T* 
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Next, by Plancherel theorem and properties of the Fourier transform ([18], p. 100-102), 

d 



\f( a )(x)\ 2 (l + w{x))dx = f\f^(x)\ 2 dx + ^2 f\xif {a \x)\ 2 dx 

1=1 

\ty(t)\ 2 dt + ^T [\d*[t a f(t)]\ 2 at, 

so by \fa + b < yfa + yb for a, b > 0, 



|/H(x)|2(l + w(x)) dx < J I \t"f(t)\ 2 M + itJj \ d ti ta f(tW dt. 



2=1 

Since for each i = 1, . . . , d, 



5f[t"/(t)]= E (*W ■ #''/(*) 

E (-)A^rv"( t ), 



0<j<c«iAfc 

by Minkowski inequality, the desired inequality follows. □ 

Finally, notice that for k > 1, there is a unique multivariate polynomial of x = (xi, . . . ,x^) G C fe , 

Pk{x) = 2. a iX l , with a; G Z + , 

where I& = {i = («i, • • • ,ik) € Z+ : X)j=i J*i = ^}> sucn that for any fc-times differentiable function 
^ on R, letting ^ = exp(^), 

4 k) = Pk(V, *", • ■ • , # W )</> = 2 aiiw-ifcC*')* 1 ^')* 3 • • • (^ (fc) ) ife ^. (44) 

Note that if \£(i) = -£ 2 /2, then (-l) fc P fc (^'(t), . . . , *( fc )(i)) is the fcth-order Hermite polynomial. 
Proof of Lemma 6. Denote k = h(d). By Lemma 5, /e G J^(M ). By Lemma 9, for i = 1, . . . , d 

|$tf { (t)| < A|$¥ x .(*)| + S|^* T .(t)| + e 2 ||i|| < (d + e 2 )||i|| 
and for 2 < j < k, 

\8{9 ( (t)\ < A\di* x *(t)\+B\di* T *(t)\+e 2 l{j = 2} 

< (R*y~ 2 Ad + (S*) j ~ 2 Bd + e 2 l {j = 2} 

< Ad + Bd + e 2 l {j = 2} < d + e 2 . 

Thus, by (44), |5^(t)| < (d + e 2 )•? ^- ( ||* 1 1 , 1, 1, ... , l)|^(t)|. Since Pj(x, 1, . . . , 1) is a jth-order 
polynomial of x E C with coefficients only depending on j, there is a polynomial h(x) = hd, m {x) of 
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order no greater than 2m + Ik with coefficients only depending on (d, m), such that for all j3 with 
\/3\ < m, i = 1, . . . , d and < j < k, 



\t^d{^(t)\ 2 dt<(d + e 2 ) 2k j h(\\t\\m®\ a &- 
Write 

h = Jl{\\t\\< 1/R*} fc(||t||)|^(t)| 2 di, I 2 = Jl{\\t\\> 1/R*} h(\\t\\)\Mt)\ 2 ^ 
By 1) of Lemma 10, for ||t|| < 1/R*, 

Re[* f (t)] > ARe[Vx*(t)] + BRe[® T *(t)} > C 2 C 2 \\t\\ 2 /2. 
Then 

h < //t(p||)exp{-2Re[%(t)]}dt< fh(\\t\\)exp{-CfC 2 \\tf}dt = c!(d,m). 

On the other hand, by 2) of Lemma 10, for ||t|| > 1/R* , 

Re[^(t)] > ARe[V x *(t)} + BRe[* T * (t)} > Q T (l/\\t\\)\\t\\ 2 /2. 
Therefore, by change of variable t = suj with s > and uj £ S, 

h < U{\\t\\ > i/jp}^(iitii)«p(-ft.(i/iitii)iitn a )dt 



= c"(d) / s d ' 1 h(s)e- s ^ 1/s)s ds. 

Jl/R* 

As a result, for all /3 with |/3| < m, i = 1, . . . , d and < j < k, 



oo 



l*^Vf (*)| 2 dt < (1 + e 2 /d) 2k d 2k c'(d, m) + c"(d) / g^^We-"^' ds . 
Combining the bound with Lemma 11, the proof is complete. □ 
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